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ABSTRACT. The paper discusses properties of a DATALOG^-Afe query language 4QL, originally 
outlined by Maluszyriski and Szalas [MS11]. 4QL allows one to use rules with negation in heads 
and bodies of rules. It is based on a simple and intuitive semantics and provides uniform tools for 
"lightweight" versions of known forms of nonmonotonic reasoning. Negated literals in heads of rules 
may naturally lead to inconsistencies. On the other hand, rules do not have to attach meaning to 
some literals. Therefore 4QL is founded on a four-valued semantics, employing the logic introduced 
in [MSV08, VMS09] with truth values: 'true', false', 'inconsistent' and 'unknown'. In addition, 4QL 
is tractable w.r.t. data complexity and captures PTlME queries. Even though DATALOG^ is known as 
a concept for the last 30 years, to our best knowledge no existing approach enjoys these properties. 
In the current paper we: 

— investigate properties of well-supported models of 4QL 

— prove the correctness of the algorithm for computing well-supported models 

— show that 4QL has PTlME data complexity and captures PTlME. 

KEYWORDS: query language, data complexity, paraconsistent semantics, DATALOG^ 



1 . Introduction and Preliminaries 

This is a companion paper to our conference paper [MSI 1]. It is motivated by the prob- 
lem of handling explicit negative knowledge in a rule language. This means that negation 
can appear in bodies and heads of rules, as done in DATALOG -1 ^ (see, e.g., [AHV96]). How- 
ever, in contrast to the traditional query languages based on nonmonotonic logics initially 
derived from the Closed World Assumption [Rei78, AD98] (CWA) we focus our attention 
on explicit negative knowledge. In [MSI 1] we outlined a novel lightweight approach to this 
problem, separating the issues of incomplete/inconsistent knowledge and nonmonotonicity. 
To make separation between monotonic-nonmonotonic clear, we first follow the Open World 
Assumption (OWA) in the presence of the explicit negation and later provide simple con- 
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structs allowing one to structure knowledge bases in the form of modules and to query such 
modules. As shown in [MS11], these constructs provide a natural way to express CWA and 
"lightweight" versions of other known forms of nonmonotonic reasoning. 

These methodological assumptions led to a DATALOG^-like query language, 4QL, foun- 
ded on a four-valued semantics. The semantics employs the logic introduced in [MSV08, 
VMS09] with truth values: 'true', 'false', 'inconsistent' and 'unknown', further denoted by 
t, f, t and u, respectively. 

For the four-valued query language 4QL proposed in [MS11] we defined a semantics 
based on the notion of well-supported models and we proposed an algorithm for computing 
such models. The language: 

- allows one to use rules with unrestricted negation (in heads and bodies) 

- has a simple and intuitive semantics 

- allows for "lightweight" versions of known forms of nonmonotonic reasoning. 

Due to page limits, [MSI 1] does not contain important properties of well-supported models, 
nor proofs of theorems and complexity analysis. In the current paper we therefore: 

- investigate properties of well-supported models of 4QL 

- prove the correctness of the algorithm for computing well-supported models 

- show that 4QL has PTlME data complexity and captures PTlME. 

Even though Datalog -1 ^ is known as a concept for 30 years, to our best knowledge no 
existing approach enjoys these properties. 

In Section 2 we discuss related work. Next, in Section 3, we provide syntax and semantics 
of 4QL via the notion of well-supported models and continue in Section 4 with properties of 
well-supported models. Section 5 is devoted to an algorithm for computing well-supported 
models and a proof of its correctness. In Section 6 we extend basic 4QL by very simple 
constructs which allow us to show in Section 7 that the extended 4QL captures PTlME on 
ordered databases. Finally, Section 8 concludes the paper. 

2. Related Work 

While Datalog -1 has attracted much attention (see, e.g., [AHV96] and references therein), 
a tractable and at the same time intuitive semantics for Datalog^ -1 is not reported in the lit- 
erature. The problem is often addressed by taking Datalog with negation or extended logic 
programs [GL91] as the starting point for paraconsistent extensions, see e.g. [ADP05, dAP07, 
Ari02, DP98, SI95]. 1 

Let us now discuss approaches [ADP05, Ari02, dAP07, Fit02] which are closest to ours. 

The paper [ADP05] provides a framework for paraconsistent logic programming gener- 
alizing some previous work of its authors to an arbitrary complete bilattice of truth-values, 
where belief and doubt are explicitly represented. In contrast to our approach, where default 



1. There is a rich literature on paraconsistent logics (see, e.g., [BCG07] and references therein, in par- 
ticular [CMM07]). 
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negation is not used, the focus of this work is semantical integration of explicit and default 
negation. 

The paper [Ari02] proposes a framework for dealing with logic programs with two kinds 
of negation: an explicit negation and a negation-by-failure. We use a single negation, with- 
out referring to provability, as the negation-as-failure does. We find this an advantage of 
our approach. Also our implication is different. As [Ari02] concentrates on logic programs, 
database related topics are not considered there. In particular no complexity results concern- 
ing finite domains are provided. 

Another approach, P-Datalog [dAP07], provides a paraconsistent language for knowledge 
base integration based on a four-valued logic with the truth ordering which coincides with 
our truth ordering. However, there are several important differences. First, our language only 
allows explicit negation (in the head and in the bodies of the rules) while P-Datalog programs 
only allow negation by default ~ in the rule bodies and use the CWA. Information integration 
done in P-Datalog uses the truth ordering, while we provide a more general constructs for 
achieving this goal (extended literals, defined in Section 6). Finally, the rules are interpreted 
by substantially different implications. 

A four-valued semantics for logic programs with negation was also proposed by Fitting 
(see e.g. [Fit02]). The major difference is that Fitting's semantics is based on the Belnap's 
logic. This contrasts with our approach since we use a different truth ordering. Second, 
the semantics of Fitting allows one to derive conclusions from false premises, which we do 
not allow. Unlike in our framework, a rule of a Fitting program is satisfied if and only if 
truth values assigned to the head and to the body are equal. Last and not least, the language 
considered by Fitting does not admit negation in rule heads. 

A major difficulty in defining 4QL was handling of disjunction in rule bodies. This paper 
presents a solution to this problem. A preliminary idea of a four-valued rule language not 
admitting disjunction in rule bodies appeared in our previous work [VMS09]. 

Also in the field of the Semantic Web there have been proposals addressing the problem of 
inconsistent data by defining paraconsistent extensions of Description Logics underlying the 
Semantic Web Ontology Language OWL2, see, e.g., [MS96, Str97, OW08, MH09, MailO] 
(where the issue of nonmonotonicity is not addressed) or by providing rule languages based 
on defeasible reasoning [Nut94]. 



3 . Syntax and semantics of 4QL 
3.1. Syntax of 4QL 

For a negative literal I the notation denotes its positive counterpart. We treat proposi- 
tions as zero-argument relation symbols. 

In the sequel, for simplicity, we consider ground rules only and assume that for each head 
I there is only one rule having the form: 

I :- (6u, . . . , 6 Ul ) V (6 2 i, • ■ • , b 2i2 ) V ... V (b ml , b mim ). (1) 
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The rationale for this syntax is as follows. In two-valued logic conjunction of the classical 
implications of the form a — > £ and & — £ is logically equivalent to the implication a V b — > I. 
Thus, multiple Datalog rules with the same rule heads express the disjunction of bodies of 
these rules. In logic programming this is sometimes used to avoid explicit disjunction in rule 
bodies. In our four-valued logic the equivalence does not hold (see Remark 4). Therefore, in 
order to have disjunction in the rule body we have to express it explicitly therein. 

In 4QL one can consider admission of multiple rules with the same rule heads, inter- 
preted as four-valued conjunction of such rules. However, intuitive understanding of such 
four-valued rules is rather difficult. Intuitively we consider a rule with head £ as a separate 
information source concerning £. We prefer to leave to the user how information from differ- 
ent sources is to be combined rather than to impose four-valued conjunction as the only way 
of combination. Therefore at the basic level of 4QL we do not admit multiple rules with the 
same head. In the sequel we extend 4QL with a flexible mechanism for fusing information 
from different sources. 

The empty body of a rule is denoted by 0. We sometimes refer to facts as to rules of the 
foxm£ :-0. 

DEFINITION 1 . — Let g be a rule of the form (1). Then: 

- rule(£) = g 

- head(g) d = i 

- body{g) d = (&n, . . . V (b 2 i, ■ ■ -,b 2l2 ) V ... V (b ml , . . -,b mim ) 

- fori < j < m, /3 j (g) = b jU . . . , b ji} . 

Writing rule(l), we always assume that the respective rule with £ as its head indeed exists. 

REMARK 2. — In the paper we present the case of ground rules only. However, typical rules 
with variables are allowed, too. We assume that whenever there is a variable appearing in the 
body of a rule but not in its head then it is assumed to be existentially quantified in its body. 
For example, 

P(x,y) :-q(x,y,z). 

is understood as "p(x, y) :- 3z[q(x, y, z)]". The existential quantifier is then understood as 
the disjunction q(x, y, Oi) V ... V q(x 7 y, a^), where a±, . . . , a k are all constants appearing in 
the database. □ 



3.2. The Underlying Logic 

Addressing the semantics of DATALOG^ we have made certain methodological choices. 
First of all, we decided to start with a fully monotonic query language. Therefore, rather than 
starting with CWA, as most approaches do, we have accepted OWA. This, in turn, naturally 
introduces u as a truth value. For example, having just one rule p :- qwe do not force p and 
q to be f, so have to assign them the value u. Next, since we allow negated literals in heads 
of rules, certain conclusions may contradict each other. This also gives rise to a non-classical 
truth value. Could this value be u? If one does not want to distinguish between the lack of 
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information and inconsistency, such an identification could do the job (see, e.g., [dACM02, 
CMdAOO, DLSS06, DMS06, Fit02, Fit85, NS10]). However, we find it more natural and 
informative to distinguish between u and i. We then adopt the four-valued logic introduced 
in [MSV08] as the semantical foundation. The truth tables for the connectives are shown in 
Table 1. 



Table 1. Truth tables for A, V, — > and 
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Notice that our logic is different from the commonly used four- valued Belnap's logic f Bel77] . 
In Belnap's logic two orderings on truth values are considered, known as knowledge ordering 
and truth ordering. As shown, e.g., in [Dub08] as well as in our previous work [VMS09, 
NS10], Belnap's truth ordering is problematic in areas we focus on and we introduced instead 
a different truth ordering f < u < i < t, which was also independently proposed in [dAP07]. 
The truth tables for conjunction A and disjunction V are respectively defined as minimum and 
maximum w.r.t. our truth ordering. 

Remark 3 . — To motivate our truth ordering, where in particular u < I, first note that i 
contains an evidence that a given proposition is t so in this respect it is "closer" to truth than 
u. On the other hand, one can argue that t reflects some evidence that the corresponding 
assertion is not true, while u does not. Observe however, that in the monotonic layer we 
do not want to derive conclusions from unknown premises, since such derivations lead to 
nonmonotonicity. In contrary, we want to derive conclusions from inconsistent premises to 
"correct" heads of rules. To illustrate this point consider the following rule: 

reduce Jtemperature :- high_temperature. 

If, in a given interpretation, high Jtemperature is u then one does not want to be forced to 
derive any conclusions whether to reduce temperature or not, unless nonmonotonic reasoning 
is used (which is addressed in Section 6). 

Consider now a situation, when at some point high_temperature is t. Then we also want 
to conclude that reduce_temperature is t. If the situation changes and highjemperature 
becomes (, e.g., by deriving new facts then the previous conclusion as to the truth of 
i 'educe Jtemperature is no longer justified and is to be adjusted to i 

The above case indicates that i behaves like t (so can be considered as a designated value), 
while u does not. This motivates our choice as to u < t. □ 

The implication — > is a four-valued extension of the classical implication. While its ar- 
guments range over the four logical values the truth value of the implication is t or f. The 
implication is used to interpret clauses of 4QL. Whenever the body of a clause has the value 
f or u, the truth value of the clause is defined to be t. Intuitively, this reflects our intention 
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not to draw conclusions from false or unknown information. Namely, a clause with unknown 
or false body is always satisfied, so one does not have to update its head. From inconsistent 
body we want to conclude that the head is also inconsistent. Thus, for the predecessor with 
value i the implication is t if the successor is i, and f otherwise. The implication is t if the 
predecessor takes value t and the successor is t or t. The latter case is needed to handle the 
situation when both head and its negation are to be derived on the basis of true assumptions. 

Remark 4. — Note that the classical equivalence 

[(p g) A (r -> q)] = [(p V r) q], 

allowing one to consider several rules as a single one, does not hold in the four-valued setting 
we deal with. For example, (1 — > t) A (t — > t) is t, while (t Vt) — »• t is f. 

Observe also that that the negation defined in Table 1 is not truth reversing (t > u but not 
-i t < -ill). Our definition of -i reflects the intuition that when we have no information as to 
the truth or falsity of p then we still have no such information about -ip. This usually is not 

dcf 

questioned in the three-valued case without 1, where = u. Similarly, having inconsistent 
p, we still have inconsistent ->p. In order to have a truth reversing negation, we would have 

def def 

to accept that -i = u and = t. We do not find this intuitive. Rather than being driven by 
a priori assumptions as to properties of negations and orders, our choice reflects our intuitions. 

□ 

DEFINITION 5. — By an interpretation we mean any set of ground literals. The truth value 
of a literal i in interpretation X, denoted by T(t), is the value defined as follows: 

t if£el and <jL 1 

dcf J i ifi&T and € I 

u if I inland (-it) I 

f if I <jt 1 and (-4) G X. 

The definition of interpretation is extended for formulas built from literals using V, A, -> and 
— > according to Table 1. 

Observe that in our approach we deal with direct contradictions, but also with more gen- 
eral notion of inconsistency. Namely, as discussed in [CM02], 

"the contradictoriness of a given theory/logic was to be identified with the fact 
that it derives at least some pairs of formulas of the form A and -^A, while incon- 
sistency was usually talked about as a model-theoretic property to be guaranteed 
so that our theories can make sense and talk about 'real existing structures' [...]". 

Inconsistencies may appear in our approach not only due to literals contradicting each 
other. Heads of rules may be assigned the value i by applications of rules with inconsis- 
tent bodies. In the rest of the paper we then prefer to talk about more general concept of 
inconsistencies rather than limited only to contradictions. 



I(£) 



3.3. Declarative Semantics of 4QL 



The declarative semantics of 4QL is defined in terms of Herbrand models. In more tra- 
ditional approaches Herbrand models are intensively studied (see, e.g., [CGT89]). Our ap- 
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proach is based on well-supported models. Since for any set of 4QL rules there is a unique 
such model (see Theorem 15), it provides truth values for all formulas. Note that interpreta- 
tions we deal with may contain negative literals. Therefore we assume that Herbrand bases 
consist of all (positive and negative) ground literals. 

DEFINITION 6. — A set of literals X is a model of a set of rules S, denoted by X (= S, iff 
for each rule g 6 S we have that X(body(g) — > head(g)*) = t, where it is assumed that the 
empty body takes the value t in any interpretation. 

It should be noticed that the Herbrand base is a model of any set of rules. However, our 
intuition is that the knowledge represented by a set of rules should be based on the explicit 
knowledge represented by facts. Minimal models, if exist, may not fulfill this requirement, as 
shown in the following example. 

Example 7. — Let S be the following set of rules: 



wait :- overloaded V restjtime . (2) 

rest_time :- wait . (3) 

-i overloaded :- rest_time . (4) 

overloaded . (5) 



Observe that: 

- by (5), overloaded G X 

- now, by (2), wait G X thus, by (3), rest_time G X 

- therefore, by (4), -i overloaded G X. 

Thus in every model of S, overloaded has the value t. 

A minimal model of S is X m i n = {overloaded, -^overloaded, wait, rest_time} but the 
only fact of S (i.e., overloaded) has in this model value i so there are no facts supporting the 
truth of wait and restjtime in this model. The intuitively correct model for S is 

X = {overloaded, -^overloaded, wait, ->wait, restjtime, -restjtime). 

Namely, wait obtains the value i to satisfy (2). Then, to satisfy (3), restjtime obtains the 
value t. □ 

The following definitions reflect our intuitions. Note that well-supportedness closest to 
ours is that of [Fag94]. However, [Fag94] concerns the classical two-valued setting. 

DEFINITION 8. — Let X be an interpretation and -< be a strict partial order on X. Given 
a set of rules S, we say that a model X of S supports a rule g G S w.r.t. -< provided that: 

body(g) — or there is Pj{g) such that X(f3j{g)) = t 

and for all literals i G @j (g) we have that i -< head(g). 
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DEFINITION 9. — A model X of a set of rules S is well-supported provided that there exists 
a strict partial order -< onX such that for every literal I 6 X, 

- ifX(£) = t then X supports rule(£) w.r.t. -<. (6) 

- ifX(£) = t then (at least) one of the following conditions hold: 

-X supports rule(£) w.r.t. -< (7) 

- there is a rule g € {rule(£), rule(-<£)} with X(body(g)) = i 
for which there is (3j(g) with X(fij (g)) = i such that (8) 
for all literals i € Pj{g), I -< head(g). 

Remark 10. — In the conditions (7)-(8) one could also expect a clause concerning 
X(-i£) — i On the other hand, X(£) — i implies that also X(-^£) = i so the respective 
condition for X(-^£) = i is already included in Definition 9. For example, the interpretation 
{rest, -irest, overloaded, -^overloaded} is a well-supported model for the set of rules: 

-i rest :- overloaded. (9) 
rest. (10) 
overloaded. (11) 
-i overloaded. (12) 
as well as for the set consisting of the fact 
->rest. 

together with rules ( 1 0), ( 1 1 ), ( 1 2). □ 
Example 11 (Example 7 continued). — The minimal model 

T-rnin = {overloaded, ^overloaded, wait, rest_time} 

of the set of rules considered in Example 7 is not well-supported. Namely, X m i n (wait) = t. 
According to Definition 9, there should be an order -< such that X m i n supports rule (2). By 
Definition 8, 

rest_time -< wait. (13) 
Since X m i n (rest_time) — t, a similar reasoning show that we should have 

wait -< restjtime, 

which together with (13) cannot hold, since -< is required to be a strict partial order. □ 



4. Properties of Well-supported Models 



The following lemma explains the status of heads of rules supported by a given interpre- 
tation. 
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LEMMA 12. — Let S be a set of rules and I be a model of S. If I supports a rule g G S 
w.r.t. a strict partial order -< then X(head{g)) G {t, i}. 

Proof — Assume that I supports g G S w.r.t. -<. By Definition 8, body(g) = or there is 
/3j(g) such that T(/3j (g)) = t and for all literals i 6 /^-(g) we have that i ~< head(g). 

In the first case body(g) — 0, so \£ :- .] G 5. By Definition 6, the empty body is t in 
any interpretation, so the rule is satisfied in 2" only when 2(head(g)) G {t, i}. 

In the second case there is j3j(g) such that I(f3j(g)) = t. Similarly to the previous case 
we have that g is satisfied in I only when I(head(g)) G {t, t}. ■ 

The next two lemmas (13 and 14) are used in the proof of Theorem 15 claiming the 
uniqueness of well-supported models. 

LEMMA 13. — Let M\ and M2 be well-supported models for a set of formulas S. Then, 
for every literal £, we have that M \ {£) = t implies M2 it) G {t, t}. 

Proof — Since Mi is a well-supported model, there exists a strict partial order -< on Mi 
satisfying conditions of Definition 9. Thus, for every literal I G Mi, whenever Mi(t) = t, 
there is a rule g G S with head(g) = £ such that A^i supports g w.r.t. -<. 

We prove the lemma by induction on -< . 

- body(g) = implies that the rule [£ :- .] is in S. The empty body is t in any interpreta- 
tion, so the rule is satisfied in A^2 only when A^f^) G {t, £}, thus the implication is trivially 
true 

- there is f3j(g) such that A^i(/3 J (g)) = t and for all literals i occurring in f3j(g) we have 
that i -< head(g). Since (3j(g) is a conjunction of literals, M.i(/3j(g)) = t implies that for 
all i in f3j(g), Mi(i) = t. By inductive assumption, for all i in (3j(g), A4a(i) G {t, t}, so 
also M.2 (Pj{g)) € {t, i}- Of course, M2 has to satisfy rule g, so for its head I we have that 

M 2 (£) e{t,i}. ■ 

LEMMA 14. — Let M\ and M2 be well-supported models for a set of formulas S. Then, 
for every literal I, we have that M\{i) = t implies M%{() G {t, t}. 

Proof — Since Mi is a well-supported model, there exists a strict partial order -< on Mi 
satisfying conditions of Definition 9. We proceed by induction on -<. 

For every literal £ G Mi, if Mi(£) = t then: 

- there is a rule g with head(g) = I such that Mi supports g w.r.t. -<; or 

- there is a rule g G S with head(g) G {£,^£} such that there is /3j(g) satisfying 
Mi(/3j(g)) = t and for all literals 1 in Pj(g), 1 ~< head(g). 

In the first case we have that Mi supports g w.r.t. -<. If body(g) = then the rule [£ :- .] 
is in S, so M2(£) G {t, t}. Consider the case when there is (3j(g) such that Mi((3j (g)) = t 
and for all literals 1 G fij{g) we have that M\(i) = t and 1 -< head(g). Then, by inductive 
assumption, we obtain that for such literals M2(i) G {t, £}. In consequence, M2{fij{g)) G 
{t,t}. Since I = head(g), we have that M2W G {t,t}. 

It remains to consider the case when there is a rule g G S with head(g) G {£, ~^£} for 
which there is fij{g) satisfying Mi(f3j(g)) = i and for all literals 1 in /3j(g), 
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I -< head(g). For all such literals, € {t, i} and at least one of them takes the value i. If 

A^i(j) =t then by Lemma 13, G {t, t}. If M\{i) =1 then by the inductive assumption 

M%{%) e {t, i}. Thus MaC9j(e)) e {t, i}, so _m 2 (£) e {t, i}. m 

THEOREM 15. — For any set S of rules there is the unique well-supported model for S. 

Proof — The fact that for every set of rules there is a well-supported model follows from 
Theorem 28. So here it suffices to prove that, given a set of rules S, there may not be two 
different well-supported models for S. Suppose the contrary, i.e., there are M.\ ^ M2 which 
are well-supported models of S. This means that there is a literal, say I, such that it is in one 
of the models and is not in the other. Without loss of generality we can assume that I G M.\ 
and I £ M 2 - 

Since I e Mi, we have that Mi(£) G {t, t}. By Lemmas 13 and 14, 
•Ma(0 G {t, t}, so I G M2 and a contradiction is reached. ■ 



5 . Computing the Unique Well-Supported Model 

Let us now present an algorithm for computing the unique well-supported model for 
a given set of rules. 

Definition 16. — Let S be a set of rules. 

- By C(S) we denote the set of relation symbols appearing in S. 

- By a duplicate of a relation symbol I G C(S) we understand a fresh relation symbol, 
for simplicity denoted by £'. 

- By C \S) we understand the set of duplicates of relation symbols of C(S), i.e., C'(S) = 
{t'\£€ C(S)}. 

- By Pos(S) we understand the DATALOG program obtained from S by replacing each 
negative literal ->£ of S by its duplicate £'. 

The algorithm is shown in Figure 1 . The following example illustrates its execution. 

EXAMPLE 17. — To illustrate the algorithm given in Figure 1, consider set of rules dis- 



cussed in Example 7 together with rules: 

good_mood :- rested V success . (17) 

-•rested :- -irestjbime . (18) 

rested. (19) 

success . (20) 



Phase 1 gives if = {overloaded, ^overloaded}. 
Phase 2 gives the following set of rules S": 
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- Input: a set of rules S 

- Output: the unique well-supported model 1° lor S . 




1) Phase 1 (finding basic inconsistencies): 

a) compute the least Herbrand model if of Pos(S) 

b) let If = {f, | £,£' Gig} 

2) Phase 2 (finding potentially true literals): 

a) let S" = {<? | q e S and If (head(g)) ^ t} 

b) set if to be the least Herbrand model for Pos(S') 
with literals £' substituted by -JL 

j j riiasc j yf t-LiciUiiLiiK Willi iiii,ufijijit,iicy j. 

a) define the following transformation Q s on interpretations: 




(b S IT\ — T 1 1 / ^ — 1 fhprp is n nilp ■ R, \/ u fl ]c(J 

Mr I 1 — i U H, t 111C1C la a 1 U1C l-C. — jj\ V ... V ^777! t O 


(, if ; 


such that 3k e {1, . . . ,m}[I(/3fe) = t] 


(15) 


and^Bn e{l,...,m}[ (If -I)( ( S„)=t]}. 


(16) 


The transformation $ s is mono tonic (see Lemma 21). 

Denote by if the fixpoint of $ s obtained by iterating <!> s on if, i.e., 




z? = U(*W) 




b) set I s =lf Ulf. 





Figure 1. 77ie method of computing the well-supported model for the set of rules S. 



wait :- overloaded V rest_time . 
rest_time :- wait . 
good_mood :- rested V success . 
— 'rested :- ->rest_time . 
rested . 
success . 

The resulting set if is {success, rested, goodjmood} . 

Phase 3 gives the following iterations of $ s : 

{overloaded, -^overloaded} 
{overloaded, -^overloaded, wait, ->wait} 

{overloaded, -^overloaded, wait, ->wait, rest_time, ->rest_time} 
{overloaded, -^overloaded, wait, ->wait, rest_time, ->rest_time, 
rested, ^rested} — fixpoint. 
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Hence X§ 



{overloaded, -^overloaded, wait, ^wait, rest_time, ->rest_time 
rested, -^rested}. 



Finally, I s = { success, goodjmood, overloaded, -^overloaded, wait, -iwait, 

rest_time, ->rest_time, rested, ->rested}, 
which is the desired result. Namely, as discussed in Example 7, 

overloaded, -^overloaded, wait, ->wait, rest_time, ->rest_time G I. 

By (20), success G I so, by (17), goodjmood G X. Finally, to satisfy (18), both 
rest, —>rest G I. □ 

In the following lemmas and theorem we assume that S is an arbitrary (finite) set of rules 
and use notation as in the algorithm shown in Figure 1 . 

LEMMA 18. — If £ G if then the value of t is t in every model of S. 

PROOF — Follows directly from the fact that Xq is the least Herbrand model for Pos(S). ■ 

Lemma 19. — If for a well-supported model M of S we have M(£) = t then £ G if. 

Proof — According to Definition 9, if M. is a well-supported model then there is a strict 
partial order -< such that M. (£) = t implies that there is a rule 

\i :- 61 V . . . V b m ] G S 

and 1 < I < m such that 6; = b\\ , . . . , bu t , I(bi ) = t and for 1 < j < ii we have that bij -< I. 
This order shows a "computation" of S' forcing I to be in if, being the least Herbrand model 
for Pos{S') (with literals £' substituted by -*£). ■ 

LEMMA 20. — For any literal £, we have that if (£) 7^ t. 

Proof — Suppose that if (£) = I. Then, by the construction of 5" in Phase 2(a) of the 
algorithm shown in Figure 1, if (£) = t. In such a case, again by the construction of S", 
the rule making £ inconsistent is removed, so no rule with £ in its head appears in S' . Thus 
/f {£) ^ t and a contradiction is reached. ■ 

LEMMA 21. — The transformation $ s is monotonic w.r.t. set inclusion G, i.e., 

XCJ implies $ S (I) C <I> S (J). 



Proof — Suppose I C J and ^(I) % $ 5 (J r ). Then there is a literal £ such that I e 
$ S (I) and £ <J> 5 (J). Observe that £ I, for otherwise £ G J, so also £ G $ 5 ( J). Since 
£ G $ S (I) and t£X, there is a rule [£ :- 61 V . . . V 6 m ] G 5 such that: 



3k€{l,...,m}[X{b k )=i] 

e {!,..., m}[ (If -!)(&„)= t]. 



(21) 



(22) 



By (21) and the assumption that I G J, we have 



3fee {!,..., m}[J r (& fe )=(]. 



(23) 
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Suppose that 3n £ {1, . . . , m}[ (if — J)(b n ) = t]. By the assumption! C J we have that 
if - J C 1$ -1, so (If - J r )(6„) = t implies that (if - !)(&„) 6 {t,t}. However, by 
Lemma 20, for any literal I, we have that if (£) ^ t, so (if — I)(b n ) ^ i and we conclude 
that (if — X){b n ) = t, which contradicts (22). Therefore, 

^3n £ {1, • . . ,m}[ (If - J)(b n ) = t]. (24) 

By the definition of Q s and properties (23), (24) we have that £, —JL £ <I> S ( J), which contra- 
dicts the assumption that I g $ s (J r ), i.e., that $ S (I) % § s {3). ■ 

By the construction of if, we have the following obvious proposition. 
Proposition 22. — For any £ £ if we have if (£) = t. 
LEMMA 23. — The interpretation I s of S is a model for S. 

Proof — Suppose I s is not a model. Then there is a rule [£ :- b\ V . . . V 6 m ] £ S such that 
either 

(i) I s (&! V ... V b m ) = t and I s (^) £ {u,f}, or 

(ii) I s (6! V... V6 m ) = tandI s W G {u,f,t}. 

Case (i) cannot hold since in Phase 2, the algorithm would assign t to I in if and if C I s (£) . 

Case (ii) assumes that I s (pi V ... V b m ) = i so the considered rule does not participate 
in Phase 2 of the algorithm. It is the only rule with I in the head. Therefore, the rule cannot 
be assigned t, so by Definition 6 and semantics of implication given in Table 1, the rule is 
assigned f. 

Thus both cases, (i) and (ii), lead to contradiction. ■ 

In the rest of this section we prove that the interpretations obtained by our method are 
well-supported models. 

DEFINITION 24. — Let S be a set of rules. We define a family {L^ \ j £ w} of disjoint 
subsets of the Herbrand base of S as follows: 

- Lq is the set of facts in S (25) 

- L>f +1 is the set of all literals I such that rule(£) has a body component 
(i r (rule(l)) consisting exclusively of literals in if. (26) 

k<i 

Note that rule(£) considered in the above definition may have several body components 

f3 n (rule(£)) satisfying (26). 

To show that I constructed by our algorithm is well-supported we need a strict partial 
order on literals of a given set S satisfying conditions of Definition 9. The required order, 
denoted by -< s , is defined below. 
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DEFINITION 25. — The order -<- s is defined as the transitive closure of the following rela- 
tion < s : 

£' < s i iff I £ Lf +1 for some i > and I' occurs in some (3 n {rule{£)) 

consisting exclusively of literals in Lf . (27) 

k<i 

EXAMPLE 26. — Consider the Herbrand base of the set of rules in Example 7. We get 

Lq = {overloaded}, Lf = {wait}, if = {rest_time}, L§ = {^overloaded} and the 
defined strict order is overloaded-^ wait-$ s rest_time-$ s ^overloaded. Note that: 

- in the case of the minimal model {overloaded, -^overloaded, wait, rest_time} the 
above order does not satisfy condition (6) of Definition 9 since in the rule (2) there should be 
a body component which obtains the value t and consisting of literals smaller w.r.t. -< s than 
wait. The only candidate is rest_time but rest_time -ft wait 

- in the case of the intuitively correct model considered in Example 7, 
{overloaded, -ioverloaded, wait, -iwait,rest_time,-irest_time}, the above order 
satisfies conditions of Definition 9 and the model is well-supported. □ 

PROPOSITION 27. — The relation (see Definition 25) is a strict partial order. 

PROOF — As the sets if (I) are disjoint, it follows by (27) that the relation -< s is irrefiexive 
and asymmetric. Consequently, its transitive closure -<- s is a strict partial order. ■ 

We can now formulate and prove the main result. 

THEOREM 28. — For any set of rules S, its model I s constructed by algorithm given in 
Figure 1 is well-supported. 

Proof — By Proposition 27, -tp is a strict partial order. We shall show that it satisfies 
conditions of Definition 9 with -< replaced by -<• . 

We have two cases: 

The case when I s (£) = t. 

Note that X s (£) = t only when I £ if - if. Since if is the least Herbrand model for 
Pos(S') with literals £' substituted by ->£, the rule rule{£) has a body component evaluated 
to t, with all literals smaller w.r.t. -<r than £, which has not been changed by iterating $ in 
Phase 3. This shows condition (6) of Definition 9. 

The case when I s (£) = i. 

If I s (£) = i then £ e if or i £ if - if. 

Consider first the case when £ £ if . The interpretation of £ £ if is obtained from the 
least Herbrand model of Pos(S). Therefore £ and -*£: 

(a) both satisfy condition (7) (when they both have been derived using rules with body 
components evaluated to t); or 

(b) both satisfy condition (8) (when they both have been derived using rules with body 
components evaluated to t); or 
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(c) one of them satisfies condition (7) and the other satisfies condition (8). 

In the case (a) we have that I satisfies (7). In cases (b) and (c), I satisfies (8). 

It remains to consider the case when £ G Zjf — if , i.e., when £ has been added to if 
by iterating $ in Phase 3. This happens when the body of rule(£) (or rule(-^£)) became 
evaluated to i. There are two cases: 

- I G if: a body component f3i(rule(£)) (or f3i(rule(-^£))) has been t and all literals 
in that component have been smaller w.r.t. -<- s than I. This j3i(rule(£)) (or f3i(rule(-^£))) 
satisfies the condition (7) of Definition 9; or 

- I G" if: a body component f3i(rule(£)) (or f3i(rule{-^£))) has been i and all literals 
in that component have been smaller w.r.t. -<- s than £. This (3i(rule(£)) (or (3i(rule(^£))) 
satisfies the condition (8) of Definition 9. ■ 

Remark 29. — Observe that Theorem 28 provides a PTlME method for verifying whether 
a given model is well-supported since -<- s can be constructed in deterministic polynomial 
time. Definition 9 does not provide a direct PTlME method due to existential quantification 
over -< 5 . □ 



6. Layered Architecture 

In this section we introduce and discuss external literals allowing us to express non- 
monotonic rules. The idea is similar to stratification but the problem is not with negation 
but with external literals. In this section we use well-known techniques related to stratified 
Datalog" 1 . All necessary definitions and theorems related to stratifications can be found, 
e.g., in [AHV96]. External literals have also been considered in [VMS09]. 

The architecture discussed in this section is equivalent to the modular architecture of [MS 11]. 
However, it simplifies the proof of Lemma 37 and also provides intuitions related to well- 
known stratified programs. 

Let M, K be disjoint sets. In what follows we assume that all relation symbols are of the 
form M.R, where M G M and i? G K. Intuitively, members of M are names of "modules" 
or "services" and members of R are classical relation symbols. 

DEFINITION 30. — An external literal is an expression of one of the forms: 

M.R, ->M.R, M.R in T, ^M.R in T, 

where: 

- M G M is called the reference module of the external literal; 
-TC{t, f, i, u} (ifT = then £ IN T is f). 

We write £ = v to stand for £ IN {v}. For an extended literal £, by module(£) we denote its 
reference module. 

The literal -^M.R IN T is to be read as "(-.M.i?) IN T" rather than "-.(A/.i? IN T)". 

DEFINITION 31. — By an extended rule we understand a rule g of the form (1), where rela- 
tion symbols are replaced by external literals assuming that head(g) is of the form M.R or 
-•M.R. 
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The following example shows a possible use of external literals. 

Example 32. — The following rules locally close loc, where loc(X, Y, T) means that ob- 
ject X has location Y at timepoint T: 

K.loc(X, Y, T) :- L.nextTime{T 1 S), — T is the timepoint next to S 

L.house(X), — X is a house 

L.loc(X, Y, S), — location of X at time S is Y 

L.chLoc(X, S) IN {u, f}. — location change of X is U or f. 

Intuitively, the above rule states that houses do not change their location no matter whether 
M's database contains information as to the change of location or not. □ 

DEFINITION 33. — A set of rules S is well-layered iff there is a mapping 
Ks '■ M — > U) such that for every rule q E S, 

- if t is a literal of the form M.R or —\M.R appearing in body(g), 

Ks(rnodule(head(g))) > Ks(module(£)) 

- if £ is a literal of the form M.R IN T, ->M.R IN T appearing in body(g), 

Ks(rnodule(head(g))) > Ks(module(£)). 

By an immediate adaptation of the algorithm for checking stratifiability of a set of rules 
of DATALOG^ (see, e.g., [AHV96]), we have the following proposition. 

PROPOSITION 34. — Checking whether a set of rules S is well-layered takes time polyno- 
mial in the size of S. 

We now have the following definition. 

DEFINITION 35. — An extended 4QL program is any finite well-layered set of extended 
rules. 



7. Complexity Issues 

7.1. Expressing Stratified Datalog^ 

Let P be a stratified DATALOG^ program. Let {1, 2, . . . , n} be all strata of P. First, we 
replace all rules with the same head £ as a single rule whose body is the disjunction of all 
bodies of rules with head I. This replacement is correct, since here we deal with the standard 
two-valued semantics for stratified DATALOG^ programs. 

For each stratum i = 1, . . . , n of P we take distinct symbols Mi, JV* <E M and replace the 
stratum by: 

- rules obtained from rules appearing in i-th stratum of P by replacing each relation 
symbol R by: 

- M{ .R if R is defined in stratum i 

- Nj .R if R is defined in stratum j with j < i 
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- for each relation R defined in stratum i we add the following rules, closing the relation 
R as the Closed World Assumption CWA does: 

Ni.R :-Mi.R = t. 
->Ni.R :- Mi.R in {f,u}. 

The result of p appearing in z-th stratum of P is given by Ni(p). 

Observe that the resulting set of rules is an extended 4QL program which can never lead 
to inconsistency. 

Example 36. — Consider a stratified program, consisting of three strata, shown in the first 
column of Table 2. Using the method described above we obtain the equivalent 4QL program 
shown in the second column of Table 2, where: 



- N\.r provides the value of r 

- N 2 .s, N 2 .q provide values respectively for s and q 

- N 3 .p provides the value of p. 

Table 2. A stratified program and its 4QL representation. 



□ 



p 


-->q. 


- stratum 3 


p 


- r. 


- stratum 3 


q 


- r. 


- stratum 2 > 


s 


-q- 


- stratum 2 


r. 




- stratum 1 



N 3 .p :- 


M 3 .p = t. 


- CWA(p) 


^N 3 .p 


-M 3 .p IN {f,u}. 


- CWA(p) 


M 3 .p :- 


- ^N 2 .q V Ni.r. 


- layer 3 


N 2 .q :- 


M 2 .q = t. 


-CWA(ty) 


~^N 2 .q 


-M 2 .q IN {f,u}. 


- CWA(g) 


N 2 .s :- 


M 2 .s = t. 


- CWA(s) 


->N 2 .s 


-M 2 .p IN {f,u}. 


- CWA(s) 


M 2 .q :- 


-N-L.r. 


- layer 2 


M 2 .s :- 


-M 2 .q. 


- layer 2 


N-L.r :- 


M 1 .r = t. 


- CWA(r) 


->Ni.r 


- M\.r in {f,u}. 


- CWA(r) 


Mi.r. 




- layer 1 



Due to the above construction we have the following lemma. 

LEMMA 37. — Every stratified DATALOG^ program can be expressed by an extended 4QL 
program. 



7.2. Complexity of Layered 4QL 

First observe that Algorithm provided in Figure 1 involves standard Datalog compu- 
tations in Phase 1 and Phase 2 and a fixpoint computation in Phase 3. Such computations 
have PTlME complexity (see, e.g., [AHV96]). Moreover, extended 4QL programs can be 
constructed layer by layer, starting from the lowest layer. Therefore we have the following 
theorem. 

THEOREM 38. — 4QL with modules has PTlME data complexity. 
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Since stratified Datalog captures PTlME on ordered structures (see, e.g., Theorem 15.4.8 
in [AHV96]), by Lemma 37 we have that extended 4QL captures PTlME queries. 

THEOREM 39. — Extended 4QL captures PTlME queries on ordered structures. 

8. Conclusions 

In the paper we have investigated the query language 4QL originally outlined in [MSI 1]. 
We defined a semantics based on the notion of well-supported model and we proposed an 
algorithm for computing well-supported models. The language is simple yet powerful. 

We focussed on logical foundations and complexity of 4QL, proving the correctness of 
the algorithm for computing well-supported models and showing that 4QL has PTlME data 
complexity and captures PTlME. 

In summary, 

- 4QL provides a very flexible mechanism for dealing with the lack of knowledge and 
resolving possible inconsistencies in an application dependent manner 

- 4QL is powerful enough to express large classes of nonmonotonic rules known from 
the literature (see [MSI 1]) 

- 4QL can be used as a rule language for Semantic Web and robotics applications. 

There are still interesting questions concerning 4QL. Apart from implementing and ap- 
plying 4QL, there are still many theoretical issues. Perhaps the most important are: 

- provide a top-down method for query answering running in deterministic polynomial 
time 

- extend the method to the case of infinite domains 

- provide optimization techniques improving the performance of query answering, not 
necessarily by computing the whole well-supported model 

- provide techniques for computing the well-supported model after a database update, on 
the basis of the model computed before the update. 
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